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The most challenging scenario for Kohn-Sham density functional theory, that is when the electrons 
move relatively slowly trying to avoid each other as much as possible because of their repulsion 
(strong-interaction limit), is reformulated here as an optimal transport (or mass transportation 
theory) problem, a well established field of mathematics and economics. In practice, we show that 
solving the problem of finding the minimum possible internal repulsion energy for N electrons in a 
given density p(r) is equivalent to find the optimal way of transporting N — 1 times the density p 
into itself, with cost function given by the Coulomb repulsion. We use this link to put the strong- 
interaction limit of density functional theory on firm grounds and to discuss the potential practical 
aspects of this reformulation. 



I. INTRODUCTION 

Electronic structure theory plays a fundamental role 
in many different fields: material science, chemistry and 
biochemistry, solid state physics, surface physics. Its 
goal is to solve in a reliable and computationally afford- 
able way the many-electron problem, a complex combina- 
tion of quantum mechanical and many-body effects. The 
most widely used technique, which achieves a reasonable 
compromise between accuracy and computational cost, is 
Kohn-Sham (KS) density functional theory (DFT) [TJ0. 

Optimal transport or mass transportation theory stud- 
ies the optimal transfer of masses from one location to 
another. Mass transportation theory dates back to 1781 
when Monge [3] posed the problem of finding the most 
economical way of moving soil from one area to another, 
and received a boost when Kantorovich in 1942 gener- 
alized it to what is now known as the Kantorovich dual 
problem [3]. Optimal transport problems appear in var- 
ious areas of mathematics and economics. 

In this article we show that one of the most challenging 
scenario for KS DFT, that is when the repulsion between 
the electrons largely dominates over their kinetic energy, 
can be reformulated as an optimal transport problem. 
As we shall see, the potential of this link between two 
different well-established research areas has both formal 
and practical aspects. 

It is difficult to write a paper fully accessible to two 
different communities such as mass transportation and 
electronic density functional theory. In an effort towards 
this goal we have chosen to use for both the optimal 
transport and the DFT part the most commonly used 
notation in each case, translating from one to the other 
throughout the paper. The article is organized as follows. 
We start in Sec. HI] with a review of the motivations to 
study the strong-interaction limit of DFT and the chal- 
lenges that this limit poses. Right after, in Sec. Ill we 



discuss the implications for DFT of our mass transporta- 
tion theory reformulation of this limit, anticipating the 
results that will be derived in the subsequent sections. 



This way, the first part of the paper is a self-contained 
presentation written with language that is entirely fa- 
miliar to the density functional theory community. The 
mass transportation theory problem is then introduced 
in Sec. |IV|and used in Sees. [VfVl| to address the strong- 
interaction limit of DFT and derive the results antici- 



pated in Sec. Ill Simple examples, mainly thought to 
illustrate the problem to the mass transportation theory 
community, are given in Sec. |VII| This second part of 
the paper is thus mainly written in a language familiar 
to the optimal transport reader. 



VIII 



The last Sec. 

devoted to a final discussion of the connection between 
these two different research areas and to conclusions and 
perspectives. Finally, many of the technical details are 
given in the Appendix. 



II. STRONG INTERACTIONS IN DFT 

In the formulation of Hohenberg and Kohn (HK) [T], 
electronic ground-state properties are calculated by min- 
imizing the energy functional E[p] with respect to the 
particle density p(r), 



E[p] = F[p] + / Ucxt (r)p(r)dr 



(1) 



where u e xt(r) is the external potential and F[p] is a 
universal functional of the density, defined as the ex- 
pectation value of the internal energy (kinetic energy 
T = —^J2i=i^i P ms electron-electron interaction en- 
ergy Vee = Eili EjLi+i I 1- * _ r j\~ 1 ) of the minimizing 
wave function that yields the density p(r) [5], 



F[P\ 



min(^ , |T 



V ee \V). 



(2) 



Here and throughout the paper we use Hartree atomic 
units (h = m e = ao = e = 1). 

In the standard Kohn-Sham approach [2] the mini- 
mization of E[p] in Eq. (JlJ is done under the assump- 
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tion that the kinetic energy dominates over the electron- 
electron interaction by introducing the functional T s [p], 
corresponding to the minimum of the expectation value 
of T alone over all fermionic (spin-4 particles) wave func- 
tions yielding the given p [5], 



T s [p] = min(*|r|*> 



(3) 



The functional T s [p] defines a non-interacting reference 
system with the same density of the interacting one. The 
remaining part of the exact energy functional, 



E Uxc [p] = F\p]-T a \p], 



(4) 



is approximated. Usually En xc [p] is split as -EhxcIp] = 
U[p] + E xc [p], where U[p] is the classical Hartree func- 
tional, 



U\p] 



1 f d r (dr'^m. 



(5) 



and the exchange-correlation energy E xc [p] is the crucial 
quantity that is approximated. 

The KS approach works well in many scenarios, but as 
expected, runs into difficulty where particle-particle in- 
teractions play a more prominent role. In such cases, the 
physics of the HK functional F[p] is completely different 
than the one of the Kohn-Sham non-interacting system, 
so that trying to capture the difference F[p] — T s [p] with 
an approximate functional is a daunting task. A piece 
of exact information on E xc [p] is provided by the func- 
tional V^ e [p], defined as the minimum of the expecta- 
tion value of V ee alone over all wave functions yielding 
the given density p(r), 



V^[p) = min(*|V ee |v]/ 



(6) 



The acronym "SCE" stands for "strictly correlated elec- 
trons" [B] : V'gg [p] defines a system with maximum pos- 
sible correlation between the relative electronic positions 
(in the density p), and it is the natural counterpart of the 
KS non-interacting kinetic energy T s [p] . Its relevance for 
E xc [p] increases with the importance of particle-particle 
interactions with respect to the kinetic energy [5] • For 
low-density many-particle scenarios, it has been shown 
that V r J, CE [/9] is a much better zero-order approximation 
to F[p] than T s [p] [MEE]: this defines a "SCE-DFT" al- 
ternative and complementary to standard KS DFT. In 
more general cases, the dividing line between the regime 
where the KS approach with its current approximations 
works well and the regime where a SCE-based approach 
is more suitable is a subtle issue, with many complex 
systems being not well described by neither KS nor SCE 
(see also the discussion in Ref. [11} . 

The functional V e | c E [p] also contains exact informa- 
tion on the important case of the stretching of the chem- 
ical bond [TTJ |T5j , a typical situation in which restricted 
KS DFT encounters severe problems. The relevance of 



V e e CE [p] for constructing a new generation of approxi- 
mate E xc [p] has also been pointed out very recently by 
Becke [T3] . Notice that Vj, CE [p] also enters in the deriva- 
tion of the Lieb-Oxford bound [T1HTB] , an important ex- 
act condition on E xc [p\. 

Overall, constructing the functional V^L [p] for a 
given density p(r) in an exact and efficient way has 
the potential to extensively broaden the applicability of 
DFT. Only approximations for V^, CE [/>] were available 
[19] until recently, when the mathematical structure of 
the exact V^i CE [p] has been investigated in a systematic 
way [20, 21j and exact solutions for spherically-symmetric 
densities (which have been used in the first SCE-DFT cal- 
culations [HE]) have been produced. However, a gen- 
eral reliable algorithm to construct V e g CE [p] is still lack- 
ing, and many formal aspects still need to be addressed. 
Here is where mass transportation theory can play a cru- 
cial role. Reformulating V^i [p] as an optimal transport 
problem allows to put the construction of this functional 
on firm grounds and to import algorithms from another 
well-established research field. 



III. RESULTS: AN OVERVIEW 

The problem posed by Eq. that is searching for 
the minimum possible interaction energy in a given den- 
sity, was first addressed, in an approximate way, in the 
seminal work of Seidl and coworkers [5J Later 
on, in Refs. [501 and [HI a formal solution was given 
in the following way. The admissible configurations 
of N electrons in d dimensions are restricted to a d- 
dimensional subspace £lo of the full iVd-dimensional con- 
figuration space. A generic point of ilo has the form 
R-fi ( s ) = (fi ( s ) fAf( s )) where s is a (/-dimensional 
vector that determines the position of, say, electron "1" , 
and fj(s) (i = 1, N, fi(s) = s) are the co-motion func- 
tions, which determine the position of the i-th electron in 
terms of s. The variable s itself is distributed according 
to the normalized density p(s) /N. The co-motion func- 
tions are implicit functionals of the density, determined 
by a set of differential equations that ensure the invari- 
ance of the density under the transformation s — > fi(s), 



p(fi(s))dfi(&) = p{s)ds. 



(7) 



They also satisfy group properties |20j which ensure the 
indistinguishability of the ./V electrons. The functional 
Ke CE [/°] i s then given by 



ds 



N N 



1 



N |f7 s )-f 7 (s) 

%=\ j=i+l 1 lK ' 3y ' 



(8) 



Notice that while in chemistry only the three-dimensional 
case is interesting, in physics systems with reduced effec- 
tive dimensionality (quantum dots, quantum wires, point 
contacts, etc.) play an important role. 

As we shall see in Sees. |IV|V] this way of address- 
ing the functional V^ e [p] corresponds to an attempt 
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of solving the so-called Monge problem associated to the 
constrained minimization of Eq. ([6]). In the Monge prob- 
lem, one essentially tries to transport a mass distribution 
pi(r)dr into a mass distribution p2(r)dv in the most eco- 
nomical way according to a given definition of the work 
necessary to move a unit mass from position ri to posi- 
tion r 2 . For example, one may wish to move books from 
one shelf ("shelf 1") to another ("shelf 2"), by minimizing 
the total work. The goal of solving the Monge problem 
is then to find an optimal map which assigns to every 
book in shelf 1 a unique final destination in shelf 2. In 
Sees. |IVflV] it will then become clear that the co-motion 
functions are the optimal maps of the Monge problem 
associated to V^L CE [p]. 

However, it is well known in mass transportation the- 
ory that the Monge problem is very delicate and that 
proving in general the existence of the optimal maps (or 
co-motion functions) is extremely difficult. In 1942 Kan- 
torovich proposed a relaxed formulation of the Monge 
problem, in which the goal is now to find the probability 
that, when minimizing the total cost, a mass element of 
Pi at position rj be transported at position r 2 in p 2 - As 
detailed in Sec.[Vj this formulation is actually the appro- 
priate one for the constrained minimization of Eq. ^ . 



We were then able to prove in Sec.|VIjfour theorems on 
V e g CE [p]. In the first one, the existence of a generalized 
minimizer for Eq. ^ is rigorously established. It is useful 
to remind here that the functional V^, ce [/j] corresponds 
to the A — > oo limit [7] of the traditional adiabatic 
connection of DFT [2"2Tf2"5] , in which a functional F\ [p] 
depending on a real parameter A is defined as 



F x [p] = min(tf|T + AV ee |tf). 



(9) 



If ty\[p] is the minimizer of Eq. ([9]), and if we define 

Wx[ P } = (*x[p}\Vee\*x[p})-U[p}, (10) 

we have, under mild assumptions, the well-known exact 
formula [23] for the exchange-correlation functional of KS 
DFT: 



E xc [p] 



W\[p] dX. 



(11) 



When A -> oo it can be shown that [51 [71 12TJ1 |2"T] 

lim W x [p] =V e s e CE {p}-U[p}, (12) 

X—>oo 

where U[p] is the Hartree functional of Eq. We have 
thus put the existence of this limit, which contains a piece 
of exact information that can be used to model E xc [p] 
[7J Q3HI3 [Ml H7J , on firm grounds. 

When p(r) is ground-state w-representable VA, ^xip] 
is the ground state of the hamiltonian 



Hx[p\=f + XV ee + V x [p\ 



where 



JV 



&M=$>AM(r«) 



(13) 



(14) 



is a one-body local potential that keeps the density equal 
to the physical (A = 1) p(r) V A. In Refs. (20] [21] and [9] it 
has been argued that 



,. v x[p](r) r 1, s 

hm = v SC E[p](r), 

X— yoo A 



(15) 



where "usce [p] (r) is related to the co-motion functions via 
the classical equilibrium equation [20] 



Vu S CE[/o](r) 



N 



i=2 



f*(r) 



f,-(r)l 3 ' 



(16) 



and it is the counterpart of the KS potential in the strong- 
interaction limit. In fact, we also have 



SV e S e CE [p] 
Sp(r) 



(17) 



While Eq. ( 16 ) is only valid if the co-motion functions 
(optimal maps) exist, Eq. (17) is more general. As 
we shall see in Sees. |V]|VI[ the Kantorovich problem 
can be rewritten in a useful dual formulation in which 
the so called Kantorovich potential u(r) plays a central 
role. The relation between the Kantorovich potential and 
^SCE[p](r) is simply 



u(r) = -u S CE[p](r) + C, 



(18) 



where C is a constant that appears if we want to set 
^SCE(l r l — > oo) = 0, with |r| denoting the distance from 
the center of charge of the external potential. With our 
Theorems [2]|4] we have proved that under very mild as- 
sumptions on p(r) this potential exists, it is bounded 
and it is differentiable almost everywhere, also for cases 
in which the co-motion functions do not exist, thus ad- 
dressing the v-representability problem in the strong- 
interaction (A — > oo) limit. 

Theorem Py also proves that the value of V^ CE [p] is 
exactly given by the maximum of the Kantorovich dual 
problem 



(19) 



r N N N I \ 

m f / <r)p{r)dx : 5>(r,) <J2Y1 i r ._ r .i f 

[ J i=l i=l j>i 1 1 3 1 J 



The condition ££Li w ( r *; - <u l= i L^j^^^ 
simple physical meaning: it requires that at optimal- 
ity the allowed subspace f2o of the full Nd configuration 
space be a minimum of the classical potential energy. 
This can be easily verified by rewriting this condition in 
terms of «scE[p](r) using Eq. (18): 



< Ef =1 £f 



has 



N N ^ N 

EEuT— tt + E^ceHKI^sce, (20) 



i— 1 j>i 



( = 1 



where the equality is satisfied only for configurations be- 
longing to and -Esce is the total energy in the SCE 
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limit 



liniA^oo A 1 E\, where E\ is the 



ground-state energy of ( 13 1 



Equation ( 20 ) is related to the Legendre transform for- 
mulation of Lieb [25] of the KS functionals, but it has 
the advantage of being only a maximization under lin- 
ear constraints, meaning that it can be dealt with linear 
programming techniques. 

We were not able to prove the existence of the co- 
motion functions (optimal maps) in the general case, al- 
though we have hints that, for reasonable densities, it 
migth be possible. As mentioned, this is always a deli- 
cate problem. We could only prove the existence of an 
optimal map in the special case N = 2 (Appendix [B]) . 

In the following sections we introduce the optimal 
transport problem and we give the details of the results 
anticipated here. 



IV. OPTIMAL TRANSPORT 

In 1781 Gaspard Monge [3] proposed a model to de- 
scribe the work necessary to move a mass distribution 
Pi = pi dx into a final destination p2 = p 2 dx, given the 
unitary transportation cost function c(x, y) which mea- 
sures the work to move a unit mass from x to y. The 
goal is to find a so-called optimal transportation map f 
which moves p\ into P2, i.e. such that 

Pz(S) =pi(r 1 (S)) V measurable sets S, (21) 

with minimal total transportation cost 



c{x,f(x)) dpi. 



(22) 



The measures p\ and p2, which must have equal mass 
(normalized to one for simplicity), are called marginals. 
The natural framework for this kind of problems is the 
one where X is a metric space and p\ , P2 are probabilities 
on X. However, the existence of an optimal transport 
map is a very delicate question (for a simple example, 
see Sec. VII I, even in the classical Monge case, where X 

and the cost function is the 
y\. Thus in 



is the Euclidean space 
distance between x and y, c(x,y) = \x 
1942 Kantorovich [?] proposed a relaxed formulation of 
the Monge transport problem: the goal is now to find 
a probability P{x, y) on the product space, which mini- 
mizes the relaxed transportation cost 



c(x,y) P(dx,dy) 



over all admissible probabilities P, where admissibility 
means that the projections irf P and t^P coincide with 
the marginals p\ and P2 respectively. Here the notation 
irfP means that we integrate P over all variables except 
i th . The Kantorovich problem then reads 



the "' th 



mm • 



c(x, y) P{dx, dy) : nfp = p 3 for j = 1,2}, 



where j — 1,2 denotes, respectively, the variables x and 
y. The minimizing P(dx,dy) — P(x,y)dxdy in Eq. (23 I, 



called transport plan, gives the probability that a mass 
element in x be transported in y: this is evidently more 
general than the Monge transportation map / which as- 
signs a unique destination y to each x. 

The generalization to more than two marginals is cru- 
cial for our purpose and is written as 



mm 

p 



c(x\,...,x N )P(dxx,...,dx N ) : 
-k*P = pj for j = 1, . . . , N 



}. (24) 



The analogous of the Monge problem in this case is to 
find N maps fa such that fi(x) = x, Pi(S) = Pi(fi(S)) 
for every measurable set S, and (fa, . . . , /jv) minimizes 



c(fi(xi), f N (xi))pi(dxx), 
among all maps with the same property. 

V. REFORMULATION OF V e s e CE [p] 

We can now see that the way in which Vji [p] was ad- 
dressed in Refs. HOI and HH (briefly reviewed in Sec. |III[ ) 
corresponds to an attempt of solving the Monge problem 
associated to the constrained minimization of Eq. (JsJ) , 
where the co-motion functions are the optimal maps. In- 
deed, Eq. (21) is a weak form of Eq. |7| which does not 
require / to be differentiable. 

However, as said, proving the existence of the optimal 
maps is in general a delicate problem. Moreover, the 
problem posed by Eq. Q has actually the more general 
Kantorovich form p4|. This can be seen by doing the 
following (with x € Mr): 

• identify the probability P(dxx , . . . , dxjf) with 

. . . ,x N )\ 2 dxi, . . .,dx N ; 

• set all the marginals Pi equal to the density divided 
by the number of particles N, pi = j^pdx; 

• set the cost function equal to the electron-electron 
Coulomb repulsion, 



c(x 1 ,x 2 ,...,x N ) 



N N 

EE 

1=1 j = i+l 



1 



(25) 



Thus, solving the problem of finding the minimum possi- 
ble electron-electron repulsion energy in a given density is 
equivalent to find the optimal way of transporting N — 1 
times the density p into itself, with cost function given 
by the Coulomb repulsion, in the relaxed Kantorovich 
formulation. 

What are the advantages of this reformulation? As an- 



(23) ticipated in Sec. Ill we can put many of the conjectures 
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on V e g CE [p] [niHUHnj on nrm grounds, and we can rewrite 
Eq. @ in a convenient dual form that allows to use lin- 
ear programming techniques, with the potential of giving 
access to a toolbox of algorithms already developed in a 
different, well-established, context. 



VI. THEOREMS ON V^ e CE [p] 

From the point of view of mass transportation the- 
ory, the problem of Eq. ^ poses two challenges: i) the 
cost function corresponding to the Coulomb potential, 
Eq. (25 1, is different from the usual cost considered in 
the field. In particular, it is not bounded at the origin 



and it decreases with distance, thus requiring a general- 
ized formal framework; «) the literature on the problem 
with several marginals is not very extensive (see, e.g., 
[2H1 [30]) . Nonetheless, we could prove several results. In 
what follows, we state them, relegating many technical 
details of the proofs in the Appendix. 

Theorem 1. If the cost function c is nonnegative and 
lower semicontinuous there exists an optimal probability 



Popt for the minimum problem (24) 



Proof. The proof is an application of the Prokhorov com- 
pactness theorem for measures. In fact, taken a mini- 



mizing sequence (P n ) for problem (24), since they are all 
probabilities, the sequence (P n ) is compact in the weak* 
convergence of measures, so (a subsequence of) it con- 
verges weakly* to a nonnegative measure P, and this is 
enough to obtain 



cdP < liminf / cdP, 



Then P is a good candidate for being an optimal proba- 
bility for problem ( 24 ) . To achieve the proof it remains to 



show that P is a probability and that the marginal condi- 
tion nfP — Pj is fulfilled. This is true if the convergence 
of {P n ) to P is "narrow", which by Prokhorov theorem 
amounts to show the so-called tightness condition: 



Ve > 3K compact in 



pNd 



P n {R Nd \K) <e, Vn€N. 



The tightness condition above follows easily by the fact 
that all P n sat 
(j = l,...,N). 



that all P n satisfy the marginal conditions KjPn = Pj 
I,..., A). ' □ 



Remark 1. If the marginals p\ , . . . , p^ are all equal and 
if the cost function c satisfies the symmetry condition 



c(xi , . . . , x N ) = c(x kl , . . . , x kN ) 



(26) 



for all permutations k, then the existence theorem above 
holds with P op t which satisfies the same symmetry con- 
dition. In fact, it is enough to notice that, taken a prob- 
ability P, the new probability 



P{xi 



,X N 



j x k N ), 



where k runs over all permutations of {1, ...,N}, has 
a cost less than or equal to the one of P and the same 
marginals. 

We now turn to the important dual reformulation. The 
standard dual problem in optimal transport theory is: 

Theorem 2. Let c be a lower semicontinuous and finite 
valued function, then 



min^ / c(xi, . . . ,Xjy) P(dxi, . . . ,cIxn) ■ 



tt*P = Pi for j = l,...,N 



f N f 1 

iax< / Uj dpj : u j{xj) < c(x\, . . . , xm) >■ 



Moreover, the dual maximization problem also admits a 
solution. 

Remark 2. Again, if p\ = ■ ■ ■ = p^ = p and if the cost 
function c satisfies the symmetry condition (26), then 



the dual problem admits a solution u\ = • • • = Un = u. 
In fact, if Mi, . . . , mat is an optimal solution of the dual 
problem, the function 



u(x) = ^(ui(x) H h u N {x)) 



has the same maximal dual cost, and satisfies the con- 
straint 

u(xi) + • • • + u(xn) < c(xi, . . . , Xjv). 
Therefore, in this situation the dual problem becomes 

max|7V / udp : u(xj) < c(x%, . . . , a?jy) V (27) 



An optimal function u for the dual problem ( 27 ) is called 
Kantorovich potential. 

However, the theorem above does not apply directly to 
the optimal transport problem of interest here, because 
the cost, given by Eq. (25), takes the value +00 on the 
for some i ^j}. The dual formulation then 
takes the following aspect (see for instance [5Tj). 

Theorem 3. Let c be a Borel function with values in 
[0, +00] and assume that c is p\® ■ ■ ■ (&Pn almost every- 
where finite. Assume moreover that there exists a finite 
cost transport plan P. Then there exists Borel measur- 
able dual maximizers Ui with values in [—00, +00) such 



G 



that 



min< / c(xi, . . . ,xn) P(dxi, . . . ,<1xn) 



ivjP = p 3 forj = 1,...,N 



max< / Uj (xj ) dP(dxi dxw) ■ 

U] ^ .7 = 1 



/,Uj(xj) < c(xi,...,x N ] 



The assumption that c is a Borel function is large enough 
to include continuous and lower semicontinuous functions 
(also taking the value +00), in particular the Coulomb 
potential in Eq. ( 25 ) . 



The dual form of Theorem [3] does not allow explicit 
computations since it involves a plan P which may be 
not explicitly known. To overcome this difficulty we were 



able to prove that, for the cost (251 under consideration, 



the more useful dual form (27 1 still holds 



Theorem 4. Let c be the cost ( |25| ) and assume all 
marginal measures pj coincide. Then there exists a maxi- 
mizer u for the dual problem of Theorem^ which satisfies 
the formula 



N-l 



u{x) = inf {c(ik, yi . . . , vn-i) - ^ u(yj) : yi € R d \. 

i—l 

Such Kantorovich potential u is also bounded and verifies 
the equality 



N 

E 



u(xj) dP(dx\ 



, dxjv) = N / u(x)dp{x) 



Moreover, if p = jjp(x)dx, then u is differ entiable al- 
most everywhere and Vit is locally bounded. 

In Sec. |III| we have already discussed the physical 
meaning of the Kantorovich potential u: it is an effec- 
tive single particle potential, playing the same role of the 
KS potential in the strong-interaction limit. 

The proof of Theorem [4] is discussed in Appendix "K 
We were also able to prove, as reported in Appendix B 
the existence of an optimal map ( co-motion function) f 
in the special case N = 2, in any dimension d. In the 
following section we show some explicit computations for 
simple cases. 



VII. ANALYTICAL EXAMPLES 

The purpose of this section is to illustrate the optimal 
transport reformulation of the strictly correlated electron 
problem using simple examples. Results similar to those 



reported here have been already obtained from physi- 
cal considerations in Refs. El [TTJ [20] and [TBI where so- 
lutions using chemical and physical densities have been 
presented and discussed. In a way, this section is mainly 
addressed to the mass transportation community, with 
examples of the SCE problem translated in their familiar 
language. The DFT reader can also gain insight about 
the mass transportation formulation of the SCE problem 
from these examples by comparing them with those of 
Refs. El QU [20] and M 

We first consider the radial problem for two particles 
in a given dimension d, and then the case of N particles 
in d = 1 dimension. 



A. The radial <i-dimensional case for N = 2 

Here we deal with the radial case p(x) — p{\x\) when 
the number N of particles is two. 

The mass density is transported on itself in 

an optimal way by a transport map / whose existence 
has been proved in Appendix [Bj According to the one- 
dimensional calculations of the next subsection, for ev- 
ery half-line starting from the origin the mass density 
r d_1 p(r) is transported on the opposite half-line in an 
optimal way. In other words we have 



M\x\) 



where the function a(r) can be computed by solving the 
ordinary differential equation (ODE) 



a'(r)(a(r)) d 1 p(a(r)) = -r^ 1 p(r) 
which gives 

r a(r) 



„d-l 



p(s) ds = 



1 

duJd 



„d-l 



p(s) ds 



being u>d the d- volume of the unit ball in M. d . The 
Kantorovich potential u(r) is obtained differentiating the 
dual relation u(x)+u(y) — l/\x—y\ at the optimal points, 
which gives 



u{r) 



( s + a (s))' 



ds 



+00 



{s + a(s)y 



ds 



For instance, if d — 2 and p(r) is the Gaussian function 
p(r) = ke~ kr /ir we find 



a{r) 



•-log(l - < 



— kr 



')■ 



Notice that these results were already obtained from 
physical arguments by Seidl [6] in his first paper on 
strictly correlated electrons. 

It must be also noticed that replacing the Coulomb 
repulsion l/\x — y\ by the more moderate repulsion (har- 
monic interaction) —\x — y\ 2 /2, similar calculations give, 
due to the concavity of the cost function, 



fix) 



with Kantorovich potential u(r) 
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as it was already discussed in the appendix of Ref. 1201 

B. The case N = 2 and d — 1 dimension 

We take N — 2 particles in one dimension and w 
consider the simple case 

P\{X) = P2{x) = 

and 

c(x,y) 



a if \x\ < a/2 
otherwise 



\x - y\ 

















first 






0.5 






X 


(28) 


-1.0 


-0.5 


-0.5 
-1.0 




0.5 


^ L0 



By symmetry, the goal is to send the interval [0, a/2] into 
[—a/2,0] by a transportation map / with minimal cost 



F(f) = « 



a/2 



x-f{x) 



dx. 



Since the function t i— > 1 /£ is convex on M + , by Jensen 
inequality we have 



F(f) > T 



a/2 



X — f(x) dx 



Taking into account that J^ 2 xdx — a 2 /8 and 



pa/2 p0 

/ f(x)dx = I ydy = - 

JO J -a/2 



/2 

we obtain that F{f) > a for every transport map /. 
Choosing 

f{x) = x - - 

we have F(f) = a which shows that / is optimal. The 
plot of the optimal map / on [—a/2, a/2] is shown in 
Fig. [T] This is the same optimal map used in Ref. IT51 

Similar computations can be made for different densi- 
ties p. Let us denote by n the "first half" of p and by 
T2 the "second half" ; there is no loss of generality if we 
assume that the point where p splits is the origin. In 
other words, 

r*i = p on ] — oo,0[, r 2 = p on ]0, +oo[, 



with 



/0 p+oo 
t i dx = / r 2 dx = 1/2 
-oo Jo 



The best transport map / sends r\ onto r%, so from the 
differential relation 

f(x)r 2 (f(x)) =n(x), 

taking into account that /(— oo) = 0, we find 

f(x) =R^ 1 (r 1 (x) + -^ forx<0 



FIG. 1: The optimal map / for the density of Eq. (28 1 with 
a = 2. 



where R\ and R2 are the two primitives of r\ and r 2 
respectively, vanishing at the origin. Analogously, we 
obtain 

f{x) = R^(R 2 (x)- l ^j forx>0, 

which agrees with the results of Refs. |S] and HOI For 
instance, if 

p(x) = a ~\ X \ defined in [-a, a] (29) 



we get 



f{x) = ~ (VMxJ 
\x\ \ 

plotted in Fig. [2] for a = 1. 



— x* — a ] on —a, a 




FIG. 2: The optimal map / for the density of Eq. ( |29| ) in the 
case o = l. 



Taking the Gaussian 

p(x) - {ir)- 1/2 e- x2 



(30) 



we obtain the optimal map shown in Fig. [3j 




FIG. 3: Fhe optimal map / for the Gaussian density of 
Eq. pOt). 



The case N > 3 and d — 1 dimension 



We consider the case of three particles in K, with cost 

1 1 1 

c{x,y,z) = r + r + | r. 

\x-y\ \y-z\ \z-x\ 

The transport maps formulation aims to find two maps 
/i : E -> E and / 2 : E -> E such that /fp = /fp = p 
which minimize the quantity 



1 



1 



1 



\x-fi{x)\ \h(x)-h{x)\ \h{x)-x\ 



dp(x) 



with ji — fi o f ll as it follows from the indistinguisha- 
bility of the three particles. 

The simplest case occurs when the marginal source p 
is of the form 



1^ 



i=l 

in which the optimal transport maps fx are all the per- 
mutations of the points {^i}i=i,2,3 that do not send any 
point in itself. In the case of a diffuse source p we split 
p into its three tertiles p\,p2,p^ with J pidx — 1/3 and 
we send p\ — > p2, P2 —> P3, P3 Pi through mono- 
tone transport maps. For instance, if p is the Lebesgue 
measure on the interval [0, 1] we have that the optimal 
transport map /i is 

f .t + 1/3 if x < 2/3 
Jl[X) ~ \ a? -2/3 if x > 2/3, 

and correspondingly 

h{x) = f!(x) : 



x + 2/3 if x < 1/3 
x- 1/3 if x > 1/3. 




1-0H/2W 




0.2 0.4 0.6 0.8 1.0 



FIG. 4: The optimal maps /i and f 2 = ft for N = 3 and 
p = da; on [0, 1]. 



Let us show that /i and / 2 induce an optimal plan 
P. We check the optimality by calculating an explicit 
Kantorovich potential u which satisfies, for all x, y, z: 



u(x) + u(y) + u(z) < 



1 



1 



1 



\x-y\ 



\x — z\ 



\V-z\ 



(31) 



1 



1 



and, V x, 

u{x) + u{h{x))+u{h{x)) = 



We remark that the right-hand side in equation ([32j is 
equal to 15/2. To calculate u we observe that the in- 
equality ( pl"| ) holds everywhere, then differentiating with 
respect to x we obtain at a point (x, y, z) of equality 



\x-h(x)\ \x-f 2 (x)\ 

\h(x) - f2(x)\- (32) 



u'(x) 



x-y 



x — z 



\x — y\ 3 \x — z\ 3 
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Replacing y by fi(x) and z by f2(x) we obtain 

U '(x) : 

Then we find that 
u(x) - 



ifa;e(i,|), 
-f if are (|,1]. 



(33) 



A" 



+ c 

45„ , 45 
4 4 



if are [0, I], 
if are [|,|], 

C if X e [g, l]. 



(34) 



Equation (32 1 gives c = 0. By construction, then, it 
satisfies (321) and we only need to show that it satisfies 



also (31 1 . To see this we remark that by symmetry it is 



enough to check the inequality in the set where x < y < z 
and that on this set the function 



(x,y,z) i-> 



1 



1 



1 



\x-y\ \x-z\ \y-z\ 



is convex. On the other hand by the concavity of u the 
function 

(x, y, z) i-> u(x) + u(y) + u{z) 

is concave. These two maps coincide together with their 
gradients at the point (1/6, 3/6, 5/6) and then the convex 
one has to stay above the concave. 

In the case of a possibly singular source optimal maps 
do not in general exist, and the optimal configurations 
are given by probability plans P. For instance, if 



z5>. 



with Xi ordered in an increasing way, we have that the 
optimal transport plan sends: 



3^x3 ^ 3^3 ~r U X4 



^-6 + ^5 

3 U X2 ' 3 U % 

3^3 + &x 



When N > 4 similar arguments as above can be de- 
veloped, giving transport maps f±, /j 2 , fi^ 1 that 
minimize the total cost 



c(xJ 1 (x),fi(x),...J^- 1 (x))dp(x), 



where c(xi, . . . ,xn) is given in Eq. (25). Some of these 
results were also obtained by Seidl [6], again using phys- 
ical arguments. 



VIII. CONCLUSIONS AND PERSPECTIVES 



This is already interesting per se: it allows to import and 
generalize results from one domain to the other. In par- 
ticular, with our reformulation we were able to prove im- 
mediately several results on the strong-interaction limit 
of DFT. Even more interesting, we could show that the 
problem of finding the minimum interaction energy in a 
given density can be rewritten in a convenient dual form 
consisting of a minimization under linear constraints, 
paving the way to the use of linear programming tech- 
niques to solve the strictly-correlated electron problem. 

Dual reformulations have been already proved very 
useful in the context of electronic structure calculations: 
for example, in Ref.|32]the solution of the physical hamil- 
tonian by optimizing the second-order reduced density 
matrix has been tackled with a suitable dual problem. 
The use of Legendre transform techniques for the simpli- 
fication of minimizations involving permutations in the 
many electron problem has also been stressed and ap- 
plied in Refs. [33] and [3H with very interesting results. 
All these approaches focused on the quantum mechanical 
problem, while here we deal with a special problem that 
is essentially of classical nature, but contains quantum- 
mechanical information via the electronic density. We 
know now that the optimal transport formulation is the 
right mathematical framework for the strong-interaction 
limit of density functional theory. 

It is also worth to mention that the formalism devel- 
oped here can be of interest for approaches to the many- 
electron problem which use fc-electron distribution func- 
tions (i.e., the diagonal of the fc th order reduced density 
matrix), such as those of Refs.|3S|and|3Sl In fact, in these 
approaches one usually constructs a fc-electron distribu- 
tion function /0&(ri, . . . , r^) with a given density, possibly 
minimizing the electron-electron repulsion energy. This 
would result in the same Kantorovich formulation con- 
sidered here. 

The formal and practical aspects of our new reformu- 
lation are enticing for DFT: making routinely available 
the piece of exact information contained in the strong- 
interaction limit can largely broaden its applicability, 
both by developing a "SCE DFT" [SHIE] (which uses a 
strong interacting system as a reference), and via new 
exchange-correlation functionals for standard KS DFT 
[H [11] [13] . Future work will be devoted to exploit the 
practical aspects of this reformulation. 

Note added in proof: While this article was in review, 
we become aware that a related work [37] was posted 
on arXiv. In [37] the particle-particle interaction term 
is minimized by only considering the pair density. By 
neglecting the A-representability issue, this leads to a 
two-particle problem with only one map (or co-motion 
function). 



We have shown that the strong-interaction limit of 
electronic density functional theory can be rewritten as a 
mass transportation theory problem, thus creating a link 
between two different, well established, research areas. 
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Appendix A: Proof of Theorem |4| 

For the sake of simplicity we give a sketch of the proof 
only in the case of two particles, the general case can 
be obtained in a similar way. We thus consider the case 
N = 2 in M. d with two equal marginals, p\ = p2 = p with 
p e P(R d ). The problem is then 



— dP(x,y) : nfP = Pj , for j = 1,2}, 



\x - y\ 

(Al) 

and p will be assumed absolutely continuous i.e. of the 
form p(x) dx with < p(x) and / p(x) dx = 1. 

By definition, the Kantorovich potential u is a maxi- 
mizer for the dual problem according to Theorem [3] and 
Remark [T] If we denote by P an optimal plan of trans- 
port (which exists by Theorem [T]) then in the case N = 2 
considered here u maximizes the functional 



(u(x) + u(y))dP(x,y) 



(A2) 



among all the functions which satisfies the constraint 



u(x) + u(y) < 



1 



F - y\ 



(A3) 



Under the current assumptions such a maximizer exists 
by Theorem[3]above, but u is only a Borel function which 
takes values in [— oo, +oo). Much more then is needed to 
carry on the necessary computations and we will deduce 
the needed properties. The proof will be made of several 
steps. But first, let us fix the following notation: for a 
transport plan P we denote by spt(P) the support of P, 
i.e. the smallest closed set F such that P(R d x R d \F) — 
0. 



Step 1 The first step is the following intuitive fact 
about optimal transport plans. If P op t is an optimal 
transport plan, then 

0<\x-y\ V(x,y) € spt(P opt ). 

Indeed, if by contradiction a point (x,x) <E spt(P op t) we 
may find a better transport plan P by exchanging the 
mass around (x, x) with the one around another point 
(x, y) e spt(P op t) having Z^y^x. 



Step 2 Actually something more can be said. Let P, 



Opt 



be an optimal transport plan; then for all R > there 
exists a(R) > such that 

a(R) <\x-y\ Vxe B(0, R), V(x, y) <= spt(P). 



Indeed let x € B(0,R) and (x,y) £ spt(P opt ); by the 
point above and by compactness, the diagonal and the 
support of P opt have positive distance in the set B(0, R) x 
B(Q, 2R) and we denote by f3(R) such a distance. It 
follows that 

min{/3(P),P} < \x-y\. 

We then define a(R) :— min{/3(P), R}. Moreover, we 
may choose the function a non increasing. 

Step 3 Using the second step we now prove that there 
are Kantorovich potentials which are bounded. First we 
remark that we can choose a Kantorovich potential v 
which satisfies 



v(x) = inf { -. r — v(y)\. 



(A4) 



We start with a potential u and we notice that by defi- 
nition 

u(x) < inf { - u(y)\. 

Then we can consider 

u(x) = inf { - u(y)\. 

y£R<i l\X — y\ 1 

Clearly u < u. Even if u does not satisfy the constraint, 
from the definition we get 

u(x)+u(y) = inf { 1 -u{z)}+ inf { 1 , -u(z)\, 
z£R d l\x — z\ j z£R d I \z — y\ J 

and taking y as test in the first term of the right-hand- 
side and x in the second it follows that 



u{x) + u(y) < 



F - y\ 



- u(y) - u(x), 



or equivalently if we define u(x) = 2 1 (u(x) + u(x)) 

u(x) < u(x) < u(x) and u(x) + u(y) < r. 

F - V\ 

We may now define 

v(x) — sup {w(x) : u(x) < w(x) < u(x) 
and w satisfies ( |A3| }. 



v m d i. f - y\ 

< inf 

yGR d y.\x-y\ 



The function v(x) clearly satisfies (|A3j), and if v ^ v since 

= jilA\x^y\- v(y) } 

{\x^V\- u{y) }= Hx) 
then v < v < u which contradicts the maximality of v. 



Finally, v maximizes the cost ( A2 ) since u < v 
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Step 4 As anticipated in Theorem 111 if v is a Kan- Estimates ( A5 1 and ( A7 ) give a bound from above on 



torovich potential which satisfies ( A4 ) , men there exists 



a costant C such that |t>| < C. Let P opt be an optimal 
plan of transport. The condition 



(v(x) + v(y)) dP opt (x, y) = 



together with the condition 



dP opt (x,y) 

\x-y\ 



v(x) + v(y) < 



f - y\ 



implies that 



v{x) + v(y) 



for P opt -a.e. x and y, 



I I - opt~ 

\x -y\ 

and then in particular v is finite p-a.e. Moreover, setting 

G = \ x : — oo < v(x) and By s.t. v(x)+v(y) = -\ 

I \x-y\i 

it follows from the discussion above that p(G) — 1. Let 
x £ G be a point of density 1 for G and let a and r be 
such that a > r and 

1. for all s < r we have \B(x,s) H G|/|B(z,s)| > 3/4, 

2. for all x € B(x,r) if £ spt(P) then a < 

\x - y\. 

Setting L — v(x) we have that for every z £ M. d \B(x, r/4) 



v(z) = inf I -. r — v(y)\ 



< L. 

r 



(A5) 



On the other hand for every z £ B(x,r/4) n G there 
exists y such that a < \z — y\, (z,y) € spt(P opt ) and 
v(z) + v(y) = l/\z — y\. Then 

viz) = inf { ; — v(y)X 

v ' yen" \ \y- z\ Kin ) 



A\y L z-\- v{v) } =v{z) 



< inf 

\y-z\>a K \y 

since r < a, R d \B(z, a) C R d \ B(x, r/4), and then from 
the estimate ( A5 ) 



viz) = inf {-. r — v(y)\ 

v ; \ v -z\>a\ y-z yy >) 



>L--. (A6) 
r 



To get a control of v from above in B(x, r/A) we observe 
that if A < rA~ x l d and 2 G B(x, r/4), since 

|B(z,A)|= W(i A d <^V, 

then there exists at least one y z € GP\B(x, r/4) \ B(z, A) 
such that, from the estimate (A6), 



v{z) < inf ( 1 1 1 - V (V)} 
\y-z\>\ l\y - z\ i 



< v ^— ] -y{y z )<\ + --L. (A7) 

\Vx -z\ A r 



v by a constant K. The estimate from below is now 
straightforward since 



vix) = inf < - 



1 



>(y)} 



> -K. 



Step 5 The previous steps permit to gain more regu- 
larity on the potential v. Let indeed v be a Kantorovich 
potential which satisfies (A4); we show that v is differ- 



entiable almost everywhere. To see this we consider the 
family of functions 

v n (x)= inf ( 1 -v(y)\. 

a(n)<\x-y\ I \X - y\ > 

Since a is nonincreasing we have 

v„+i(x) < v n (x). 

Moreover each v n is a Lipschitz function of Lipschitz con- 
stant l/a 2 (n). By Step 2 for x e G if \x\ < m < n then 
v(x) = v n (x) — v m (x). Then on G the potential v coin- 
cide locally with a Lipschitz function which is well known 
to be differentiable almost everywhere. 



Appendix B: Proof of the existence of an optimal 
transport map for N = 2 

Once the existence of an a.e. differentiable Kantorovich 
potential v is established, we may consider the problem 
of showing the existence of an optimal transport map 
(co-motion function) f. In the case N = 2 the proof 
can be achieved by using the basic idea of differentiating 
inequality ( A3 ) at the points of equality. 



Let P op t be an optimal transport plan and let G be 
defined as above. If Gn ■— G n 5(0, iV) we prove that 
for almost every x £ Gn there exists only one y such 
that (x,y) £ spt(P opt ) and we give an explicit expression 
for such y. It follows that P op t is induced by an optimal 
transport map. Let ujv be the function defined above; 
since vjq coincides with v on Gn, for every x £ Gn and 
y the inequality 



v N (x) + v(y) < 



1 



V 



holds. Since P op t is an optimal transport plan and p = 
a(x)dx, then for P opt -&.e. (x,y) £ spt(P op t), x belongs 
to Gn for a suitable N, x is a density point for Gn and 
vn is differentiable at x. Since for z £ Gn 



v N {z) < 



1 



v(y) 



and equality holds for z = x then if we differentiate the 
functions vn and ip(z) = uz^i — v(y) we obtain 



Vvjv(x) 



x-y 
\x - j/| 3 
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from which it follows 
V = x + 



\Vv N (x)\V* 



Vv N (x) 



(Bl) 



From equation ( Bl I we deduce that for P opt -&.e. (x,y) 
the point y is uniquely determined by x and this con- 
cludes the proof of the existence of an optimal transport 



map /, by defining 



f(x) = x 



whenever x £ B(0, N). 



\Vv N (x)\^ 



Vv N (x) 
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